Network Evolution Based on Centrality 
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We study the evolution of networks when the creation and decay of links are based on the position 
of nodes in the network measured by their centrality. We show that the same network dynamics 
arises under various centrality measures, and solve analytically the network evolution. During the 
complete evolution, the network is characterized by nestedness: the neighborhood of a node is 
contained in the neighborhood of the nodes with larger degree. We find a sharp transition between 
hierarchical and homogeneous networks, depending on the rate of link decay. We also show that 
this evolution mechanism may lead to double power-law degree distributions, with interrelated 
exponents. 
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The underlying mechanisms of link formation govern- 
ing the evolution of a network ultimately determine its 
emergent properties at the aggregate level U, [l] . In par- 
ticular, there exists numerous empirical evidences that 
the network evolution can be driven by centrality, where 
nodes with higher centrality are more likely to form or re- 
ceive links The notion of centrality was recognized 
to play a fundamental role in the most despair fields, 
ranging from dynamical systems Q , synchronization Q , 
biology , and economics H, Q ■ In spite of its impor- 
tance, a formal understanding of how networks evolve 
when the formation of links depends on the centrality of 
the nodes involved is still missing. 

Depending on the context, several measures of central- 
ity have been introduced to quantify the central position 
of a node in a network: degree, eigenvector, hetween- 
ness, closeness, PageRank and Bonacich centrality are 
the most prominent ones 0, . Due to this variety, few 
attempts have been made so far to elucidate the common 
features underlying the emergent properties of networks 
evolving by centrality [loj . 

Going from the microlevel of link formation to the 
macroscopic level, some real world networks exhibit a 
high degree of clustering while, coincidentally, their de- 
gree distributions show power-law tails. Taken together, 
these two characteristics indicate a hierarchical organi- 
zation in the network 11 1. In social and economic [l^ . 
as well as biological systems [3], it has been found that 
the hierarchical organization of networks can further be 



characterized by nestedness |13l . Il4l |: the neighborhood 
of a node is contained in the neighborhood of the nodes 
with higher degrees. In these examples, the degree of 
nestedness (defined as the fraction of links belonging to 
the nested structure) was shown to be above 93% [14 1. 

In this Letter, we study a model of network evolu- 
tion where links are created or removed based on the 
centrality of the nodes incident to the links. We show 
that the network evolution is independent of the central- 
ity measure used. Thus, for the first time, this model 
provides a general framework to study the evolution of 



networks under various measures of centrality. In this 
model, there exist stationary networks which are highly 
hierarchical when the rate of link creation is low. More- 
over, the networks are nested during the complete evo- 
lution. As we show, both, a hierarchical organization 
as well as network nestedness can be the outcomes of 
a centrality based network formation process. Finally, 
we show that in this framework, double power-law de- 
gree distributions [l^ [l^ can be stationary solutions, 
and that each power-law exponent unequivocally deter- 
mines the other. 

Model. We consider a network composed of N nodes, 
initially connected by an arbitrary network. Each node 
has a centrality associated to it. Note that it is not spec- 
ified which centrality measure this refers to. There is a 
single parameter a € [0, 1] that determines the relative 
weight between edge addition and deletion. At a rate 
-that is equal for all nodes- a node is randomly selected 
and modifies its neighborhood: with probability a, it cre- 
ates a link to the node with the highest centrality it is not 
already connected to. Otherwise, with probability 1 — a, 
a link of the selected node decays. If this happens then 
the node removes the link to the neighbor with the lowest 
centrality. If the node is connected to all the other nodes 
in the network (resp. it is isolated), and it has to create 
(resp. remove) a link, nothing happens. It is simple to 
show that the network formation process is ergodic, and 
thus starting from any initial network yields the same 
asymptotic results. Thus, and without any loss of gener- 
ality, in the following we consider an empty network as 
initial condition. 

Alternatively, we could assume that a node has only 
local information of the network 17| and creates a link 
to another one in its second-order neighborhood with the 
highest centrality. It turns out that this leads to the same 
network evolution process as the one described above. 
This makes sense in situations where centrality is known 
ex ante, for example, when centrality is a measure of 
performance in inter-organizational networks [l8l |. or it 
indicates the fitness of biological species Q • 
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Different measures of centrality can be used in the 
above network formation process, depending on the con- 
text. For example, consider a population of biological 
species in a catalytic network 0- Let species i = 1, . . . , N 
have a fitness value yi > that evolves according to the 



dynamics i/i = J2 



N 



{0, 1} is the ij-th element of the symmetric adjacency 
matrix A. In terms of relative fitness Xi = yi/'Ylfj=i Ujj 

we get Xi = Y,j'=i o-ij^j ~ Hk,j=i o-kjXj. This dynam- 
ics have a fixed point given by the eigenvector v > cor- 
responding to the largest real (Perron-Frobenius) eigen- 
value ApF of A. Hence, in this model, fitness is directly 
given by the eigenvector centrality. A second example 
comes from a socio-economic context. Consider a set of 
agents whose payoffs are interdependent in a network. 
The agents choose a contribution level > and rc- 



> 0, where 



ceive a payoff given by tt^ 



</2 + AE 



N 



with A < 1/ApF [19[. Then the unique Nash equilib- 
rium is given by their Bonacich centrality Q. The dy- 
namics introduced above correspond to a game in which 
agents form links that maximize their equilibrium pay- 
offs in each period [l^l . Further examples include degree 
centrality, closeness centrality betweenness central- 
ity 01 , PageRank and random walk centrality 21 j . Fi- 
nally, links created (removed) are the ones which increase 
the most (decrease the least) the largest eigenvalue App. 
These links were shown to modify the most the dynami- 
cal properties of the system (22l |. 

At every time step, this dynamics yield a network 
whose adjacency matrix A is stepwise: the nodes can be 
ordered by their degree, such that the zero/one entries 
in the adjacency matrix are separated by a monotonic 
step-function h{x) (see Fig. [1] right), where x = 1 — r, 
and r is the degree rank of a node. Networks with step- 
wise adjacency matrix are threshold networks 23|, |2^. In 
such a network, if two nodes i and j have degrees such 
that di < dj, then their neighborhoods satisfy Ni C Nj. 
Thus, these networks are characterized by nestedness. 
Moreover, the nodes can be partitioned into a dominat- 
ing set and independent sets. In the dominating set 5, 
every node not in S is linked to at least one member of 
S. Conversely, an independent set is one in which no two 
nodes are adjacent (see Fig. [U left). 

We now prove by induction that the adjacency matrix 
representing the state of the network at every time step 
is stepwise for the case of eigenvector centrality. First, 
at time i = 0, the first link added generates a (trivial) 
stepwise matrix. Next, let us assume that this is true at 
time t > 0. Consider the creation of a link ij. Then Vi = 

1/ApF X]fc=i "i/f^fe = ^/^PFj2keNi'"k- Thus, the larger 
is the degree of a node i, the higher is its eigenvector 
component w,;. In this way, the eigenvector centrality 
of the nodes is ranked in the same way as their degree. 
Therefore, for the model studied, a node has to establish 
a link to a node with the highest degree it is still not 
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FIG. 1: Representation of a nested network (left) and the 
associated stepwise adjacency matrix (right) with A*" = 10 
nodes. A nested network can be partitioned into subsets of 
nodes with the same degree (each subset is represented by 
circle, next to which the degree d of the nodes in the subset 
is indicated). A Une connecting two subsets indicates that 
there exists a link between each node in one set to all nodes 
in the other set. The union of the sets represented by the 
circles to the left of the dashed line induce a dominating set, 
while to the right the circles indicate independent sets. In the 
matrix A to the right, the zero-entries are separated from the 
one-entries by a step-function. 



connected to. This preserves the stepwise property of A 
(see Fig.[TJ right). Similarly, for the removal of links, the 
node with the lowest degree among the neighbors is the 
least central one, and removing a link to it preserves the 
stepwise property of A. 

The nested neighborhood structure allows us to use 
similar arguments for other centrality measures. Con- 
sider two nodes i and j in a nested graph with di > dj. 
All walks starting at node j are contained in the set of all 
walks starting from node i (after exchanging the starting 
node j with i). This implies that i has a higher cen- 
trality than j for any centrality measure that is based 
on walks or paths in the network. Hence, a proof by 
induction shows that the ranking of nodes by degree is 
equivalent to the ranking by centrality for this family of 
centrality measures. In general, this dynamics leads to a 
self-reinforcement of the nested structure. 

Given the symmetry of the adjacency matrix A, in 
order to solve the dynamic evolution of the network, it is 
enough to solve the dynamics for the nodes belonging to 
the independent sets (sec Fig.[T]). Let us denote by n{d, t) 
the number of nodes in the independent sets with degree 
d at time t. The dynamic evolution of these populations 
can be written as a rate equation. 



dtn{d, t) 



Lo[d + \ d] n{d 
Lo[d — 1 — >■ d) n{d 
{uj[d^d - 1] 



l,i) + 
l,t)- 
uj[d — s> d + 1]) n{d, t), 



(1) 



where the transition rates are simply Lu[d ^ d+l] = a/N, 
Lu[d d - 1] = (1 - a)/N. In Eq. (P we have ne- 
glected the contributions of the nodes in the correspond- 
ing dominating set (which are selected with probability 
0{N~^)) to the dynamics of the nodes in the indepen- 
dent sets. The dynamics studied is restricted to the pro- 
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file separating (non-)existing edges, and is thus related 
to surface-growth models, such as those of polynuclear 
growth; then, it can also be linked to the one-dimensional 
Ising model with Kawasaki dynamics [25 1 . 

The dynamic evolution of the network can be written 
in terms of its degree distribution P{k;t) = n{d,t)/N, 
where k = d/N denotes the normalized degree. For a 
finite population, the minimum increment possible in de- 
gree is Sk ~ 1/N. At leading order in 5k, the dynamic 
evolution corresponding to Eq. ([T} is given by 



dtP{k;t) = {l-2a)dkP{k-t) 

+ 6kdlkP{k-t) + 0{Sk^), 

dtP{0; i) = (1 - 2a) {Sk + dkP{0; t) 
-~a5kP{Q;t)) + 0{6k^), 



(2) 



with the additional boundary condition P(l, t) — 0{6k^) 
and an initial condition P{k, 0) = 6{k). 

When the terms of order Sk can be neglected, Eq. ([2]) 
becomes a usual drift equation whose stationary solution 
is either a complete network for a > 1/2 (when the link 
decay is low), or empty for a < 1/2. If a is small (and 
the link decay is high) , the network rapidly converges to 
a hierarchical structure, where only a few nodes imme- 
diately become central, and they remain in this central 
position during the network evolution. In this case it 
is the competition driven dynamics for ccntrality which 
leads to the spontaneous emergence of hubs . 

There exists a first order phase transition in the net- 
work density that gives rise to nontrivial effects around 
the critical point a = 1/2. If |1 - 2a\/Sk ~ 0(1), then 
the diffusion term in Eq. ([2|) is not negligible anymore. 
Time scales must be rescaled to r = tSk, and we get the 
Fokkcr-Planck equations 

drP{k;T) = {l~2a)dkP{k-T) + dlkP{k-T) (3) 
(1 - 2a) 



drP{Q\T) 



Sk 



-dkP{Q\T) 



(4) 



This prescription allows to relate the width of the tran- 
sition from sparse to dense networks: it must be that 
|1 - 2a\ ~ Oil), or conversely, Aa ~ N^^. 

We now study the stationary solutions for all values of 
a G [0, 1]. First, notice that the network obtained for a 
value of a > 1 /2 is the complement of the network ob- 
tained for 1 — a < 1/2. Thus, in the following we consider 
only values of a < 1/2. The step-function h{x) can be de- 
composed in a part hu{x) below the diagonal and a part 
hi{x) above the diagonal of A (see Fig. [1] right panel). 
The point x* is implicitly defined by hu{x*) — hi{x*), 
where the step-function h{x) intersects with the diago- 
nal. Let P{k) denote the stationary degree distribution. 
We have that hu{x) = ^ P{k)dk. From the station- 
ary solution of Eq. ^ we find hu{x) = 7Ve-2(i-2a)an^ 
with A/" = 2(1 - 2a)/{l - e-2(i-2«)Af). 





FIG. 2: (Left) Eigenvector centraUzation C„ in stationary net- 
works as a function of the link formation probability a for dif- 
ferent system sizes = 100 (o), TV = 1000 (□) and = 5000 
{()). Results of numerical simulations are superimposed with 
lines representing the analytical prediction. (Right) Degree 
distributions of stationary networks for different values of 
a = 0.45 (o), 0.48 (□), 0.49 (0), 0.495 (A) and system size 
= 5000. The figure reveals that the leading part of the 
distribution is exponential, while a logarithmic binning shows 
a power-law tail with exponent —1. 



This result for the functional form of the step-function 
is valid for the elements below the diagonal, i.e. for the 
nodes with low degree. We now turn our attention to the 
high degree, central nodes. From the symmetry of the ad- 
jacency matrix, one finds that hi(x) for these nodes satis- 
fies X = A/'e^^'^^^^"-''''^^-'. Thus, inverting this expression 
we get hi{x) = (ln(A/') - ln(x))/(2(l - 2a)). Conversely, 
the degree distribution is given by P{k) = —h'{l — k), 
from which the following stationary degree distribution 
is found 



(5) 



In particular, for a ~ 1/2, it results in a uniform dis- 
tribution P{k) = 1/N. Degree distributions for different 
values of a in the stationary state can be seen in Fig. [2l 
In these nested structures, the adjacency matrix is 
completely determined by the corresponding degree dis- 
tribution from Eq. ([S]) or, conversely, from the profile 
function h{x). Thus, it is possible to compute any net- 
work statistic of interest when the degree distribution 
is known. In doing so, one can show that the sta- 
tionary networks emerging in the link formation process 
are characterized by short path length, high clustering, 
negative degree-clustering correlations and dissortativity. 
The emerging networks also show a clear core-periphery 
structure, which can be measured by their centralization, 
defined as the normalized sum of differences between the 
centrality of the nodes and the maximum centrality in 
the network Q . Fig. [5] shows the eigenvector central- 
ization Ct,. It can be seen that there exists a transition 
at a = 1/2 from highly centralized to highly decentral- 
ized networks. This means that for low arrival rates of 
linking opportunities a (and a strong link decay) the sta- 
tionary network is strongly centralized, while for high ar- 
rival rates of linking opportunities, stationary networks 



4 



are dense and largely homogeneous. 

The symmetry condition for the step- function h{x) im- 
plies an important result when part of the degree dis- 
tribution (for example around the head, i.e. fc — !• 0) 
shows a power-law decay: The tail of the distribution 
(i.e. k — > oo) also follows a power-law distribution, but 
with a different exponent. To see this, let us assume 
that the head of the distribution has the functional de- 
pendence P{k) = pk^^ . If 77 > 0, this implies that 
the step- function hi{x) for low degree nodes is given by 
hi{x) = (3k~^~^ /I — rj. By inverting this function, we 
get X = [/3/(l — ?7)]/i„(a;)^/'~''+^^; and the distribution in 
the tail yields P„(fc) = [/3/(l - 77)]i/(''+i) (1 + jf)-^ l¥>^ , 
where ry^ = 77/(77 — 1). In the limit 77 — >■ 00, (there is an 
exponential distribution for the head), it implies 77^ — >■ 1, 
i.e. we recover the previous result of Eq. ([5|). The power- 
law distribution in the head and in the tail have the same 
exponent when 77 = 2. 

So far we have assumed that all nodes are selected at 
the same rate, regardless of their position in the net- 
work. Depending on the context, this assumption may 
not apply. In order to overcome this limitation, we as- 
sume that nodes are selected at a rate which depends 
on their position in the network. Note that the rate at 
which nodes are selected affects only the frequency but 
not the way in which they create or remove links. There- 
fore, the nestedness of the network is preserved. More- 
over, in these nested structures, the nodes with the same 
degree are indistinguishable, as only their degree rank 
in the network is important. We therefore assume that 
the node selection rate is a function of the degree of 
the node. As a simple example, we set F(fc) = fc** -I- A, 
where A > denotes the idiosyncratic activity of every 
node, and 77 > a parameter governing nonlincarly the 
preferential selection of nodes with higher degree. Using 
similar arguments as in the derivation of Eq. ([2|) , we can 
write the evolution of the degree distribution as follows, 

dtP(k-t) = il-^fc"-ip(fc;i) 

+ ^^~^"^ [fc" + A\ dkP{k; t)6k + 0{5e). 

In the continuous limit, the stationary solution is given 
by P{k) — D/{A + fc''), where D is a normalization con- 
stant such that P{k)dk ~ 1. The solution reduces to 
the exponential one when 77 — cxd and A N. In the 
general case, the degree distribution exhibits two differ- 
ent power-law behaviors and an inflection point. These 
two power-laws have the functional form P{k) ^ k^^ for 
the head of the distribution, and consequently P(fc) ~ 
k~n/{n-i) for the tail. 

To summarize, we have introduced a network forma- 
tion process in which link creation and removal is based 
on the position of the nodes in the network measured by 
their centrality. Interestingly, the network evolution is 
independent of the exact measure of centrality, and docs 
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FIG. 3: (Right) Degree distribution for different exponents 
in tlie head of the distribution rj = 1.2 (V), 1.5 (A), 2 (0), 
2.5 (□), 3 (o). If a nested network exhibits a power-law in 
the head (tail) of the degree distribution then the distribution 
will also exhibit a power-law behavior in the tail (head) , with 
an exponent that can be completely determined by the head 
(tail). (Left) Power-law exponents for the tail of the degree 
distribution, i.e. k ^ 00 (□), and the head of the distribution, 
i.e. fc — >■ (o), as a function of the power-law exponent of the 
head. The symbols correspond to networks of A*" = 10'' nodes, 
and the lines represent the numerical simulations. 



not require global information of the complete network 
structure. Also, extending the model to allow for het- 
erogeneous activity levels of nodes keeps this property 
unaltered, although the degree distribution is modified, 
and a restricted set of double power-law degree distribu- 
tions is found. 
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